Picture for Jaeyeon Kim

Jaeyeon Kim

K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts

Add code
Jun 01, 2026
Viaarxiv icon

Discrete Tilt Matching

Add code
Apr 20, 2026
Viaarxiv icon

The Interspeech 2026 Audio Reasoning Challenge: Evaluating Reasoning Process Quality for Audio Reasoning Models and Agents

Add code
Feb 15, 2026
Viaarxiv icon

Stop Training for the Worst: Progressive Unmasking Accelerates Masked Diffusion Training

Add code
Feb 10, 2026
Viaarxiv icon

Fine-Tuning Masked Diffusion for Provable Self-Correction

Add code
Oct 01, 2025
Viaarxiv icon

Selective Underfitting in Diffusion Models

Add code
Oct 01, 2025
Viaarxiv icon

ERGO: Efficient High-Resolution Visual Understanding for Vision-Language Models

Add code
Sep 26, 2025
Viaarxiv icon

WoW-Bench: Evaluating Fine-Grained Acoustic Perception in Audio-Language Models via Marine Mammal Vocalizations

Add code
Aug 28, 2025
Figure 1 for WoW-Bench: Evaluating Fine-Grained Acoustic Perception in Audio-Language Models via Marine Mammal Vocalizations
Figure 2 for WoW-Bench: Evaluating Fine-Grained Acoustic Perception in Audio-Language Models via Marine Mammal Vocalizations
Figure 3 for WoW-Bench: Evaluating Fine-Grained Acoustic Perception in Audio-Language Models via Marine Mammal Vocalizations
Figure 4 for WoW-Bench: Evaluating Fine-Grained Acoustic Perception in Audio-Language Models via Marine Mammal Vocalizations
Viaarxiv icon

ViSAGe: Video-to-Spatial Audio Generation

Add code
Jun 13, 2025
Figure 1 for ViSAGe: Video-to-Spatial Audio Generation
Figure 2 for ViSAGe: Video-to-Spatial Audio Generation
Figure 3 for ViSAGe: Video-to-Spatial Audio Generation
Figure 4 for ViSAGe: Video-to-Spatial Audio Generation
Viaarxiv icon

Multi-Domain Audio Question Answering Toward Acoustic Content Reasoning in The DCASE 2025 Challenge

Add code
May 12, 2025
Viaarxiv icon